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Jay and Given-Wilson have recently introduced the Factorisation (or SF-) calculus as a minimal 
fundamental model of intensional computation. It is a combinatory calculus containing a special 
combinator, F, which is able to examine the internal structure of its first argument. The calculus is 
significant in that as well as being combinatorially complete it also exhibits the property of structural 
completeness, i.e. it is able to represent any function on terms definable using pattern matching on 
arbitrary normal forms. In particular, it admits a term that can decide the structural equality of any 
two arbitrary normal forms. 

Since SF-calculus is combinatorially complete, it is clearly at least as powerful as the more 
familiar and paradigmatic Turing-powerful computational models of A-calculus and Combinatory 
Logic. Its relationship to these models in the converse direction is less obvious, however. Jay and 
Given-Wilson have suggested that SF-calculus is strictly more powerful than the aforementioned 
models, but a detailed study of the connections between these models is yet to be undertaken. 

This paper begins to bridge that gap by presenting a faithful encoding of the Factorisation Cal¬ 
culus into the A-calculus preserving both reduction and strong normalisation. The existence of such 
an encoding is a new result. It also suggests that there is, in some sense, an equivalence between 
the former model and the latter. We discuss to what extent our result constitutes an equivalence by 
considering it in the context of some previously defined frameworks for comparing computational 
power and expressiveness. 


1 Introduction 

Mathematical models of computation are useful in studying the formal properties of programming prac¬ 
tice. Indeed, the field of computing today arose partly out of the study of such abstract models: namely 
Turing Machines |[33l . the A-calculus ||3, and Combinatory Logic lITOl . which are consequently consid¬ 
ered to be archetypal computational models. It is standard practice to qualify the abilities, or expressive¬ 
ness, of a formal model of computation by demonstrating that it may simulate (and be simulated by) the 
operation of other formal models. This is the very essence of the notion of Turing-completeness, which 
encapsulates the intuition that a model may carry out any operation that is ‘effectively computable’. To 
construct such a simulation one must first give an injective mapping, showing how the terms of the source 
model may be represented by terms of the target. For example, this is the basis behind the process of 
Gddelization and the Church encoding of natural numbers |[22l . Two basic properties are then required: 
that each atomic operational step of the source model is reflected by one or more steps of the target, and 
that a program of the target model terminates whenever the corresponding source program does. The 
former is a key ingredient of Landin’s influential work on comparing languages 1251 . while the latter is 
used as a criterion for comparing expressiveness by, e.g., Felleisen lfT2ll . Formal definitions of encodings 
incorporating these properties, referred to as “faithful”, are already in use by the 90s, e.g. in lJ23, and are 
now common (see e.g. ifTSl l. 
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Recently, Jay’s work on formal models of generic pattern matching ifTTl have led, in collaboration 
with Given-Wilson, to the formulation of the Factorisation Calculus. This is a combinatory calculus 
comprising two combinators: the S combinator, familiar from Combinatory Logic; and a new F combi- 
nator. The purpose of the latter is to enable arbitrary (head) normal terms to be factorised, that is split 
into their constituent parts, thereby allowing the examination of the internal structure of terms. This en¬ 
dows Factorisation Calculus with an interesting and powerful property: that of structure completeness. 
This means that any function on terms themselves definable by pattern matching over arbitrary normal 
forms is representable. Thus, Factorisation Calculus can be viewed as a minimal, fundamental model 
characterising not only the abstract notion of pattern matching but also intensional computation. 

Jay and Given-Wilson show that Factorisation Calculus is Turing-complete, demonstrating a straight¬ 
forward simulation of Combinatory Logic in their calculus. Moreover, due to its structural completeness, 
there is a term of Factorisation Calculus which can decide the structural equality of any two arbitrary nor¬ 
mal forms. Conversely, factorisation and structural equality of normal forms cannot be so represented in 
A-calculus and Combinatory Logic, thus these models are not structure complete. This hints at some sort 
of disparity in the expressivity of the two models. In their original and subsequent research |[T^IT4l[T9l . 
Jay and Given-Wilson speculate that the added expressive power may manifest itself in a non-existence 
result for simulations of Factorisation Calculus in A-calculus, but this is not pursued in detail. 

We show that there does exist a simulation of Factorisation Calculus within A-calculus. The existence 
of such a simulation has not been demonstrated before, and this is the primary contribution of our paper. 
The simulation is made possible by a construction due to Berarducci and Bdhm, which shows how to 
encode a certain class of term rewriting systems within A-calculus. We show that Factorisation Calculus 
can be simulated by such a term rewriting system, whence the result follows. In the classical framework, 
our result signifies that Factorisation Calculus is no more powerful than A-calculus. Thus there appears to 
be a mismatch between our result and the structure completeness property that the standard simulation- 
based notion of equivalence does not account for. To begin to try and resolve this, we consider some 
research in the literature which refines the concept of computational equivalence and discuss how our 
result relates to this. 


Outline The rest of this paper is organised as follows. Section [3 recalls Jay and Given-Wilson’s Fac¬ 
torisation Calculus and its basic properties. Section [3 describes Berarducci and Bdhm’s construction 
for encoding so-called canonical rewrite systems within the A-calculus. In Section 0 we present our 
technical contribution: a simulation of the Factorisation Calculus in the A-calculus via this construction. 
Section[3then discusses, in light of our results, how the relative expressiveness of Factorisation Calculus 
and A-calculus may be characterised. Section la concludes and remarks on areas for future work. 

2 Factorisation Calculus 

We begin by presenting Jay and Given-Wilson’s Factoriation Calculus itself, and review its principal 
properties. Factorisation Calculus, or more accurately SF-calculufl is a combinatory calculus whose 
terms are those of the free algebra over the two-element signature containing the combinators S and 
F, which each reduce upon being applied to three arguments. The former is the familiar combinator 
from Combinatory Logic lUOll which applies its first and second arguments to duplicates of its third. 
The F combinator, on the other hand, introduces new capabilities in the form of factorisation: it is 

'We may say that any combinatory calculus that is structure complete is a factorisation calculus. 
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able to examine the internal structure of its first argument and process its second and third in different 
ways depending on whether that argument is atomic (i.e. itself a combinator) or compound (i.e. a partial 
application). To illustrate this, consider how the F combinator reduces in the following two instances: 

FSMA^^M ^{SX)MN ^NSX 

Observe that when the first argument is atomic, it eliminates its third argument and returns its second. On 
the other hand, when the first argument is compound it eliminates its second argument and factorises the 
first into its left- and right-hand constituent components, passing these separately to its third argument. 

Formally, SF-calculus is defined as follows. 

Definition 2.1 (SF-calculus ifTSl § 4]). The SF-calculus is a combinatory rewrite system over terms 
(ranged over by uppercase roman letters M, N, etc.) given by the following grammar: 

M,N::=S \ F | MN 

Terms of the form S, F, SM, FM, SMN, or FMN (i.e. partially applied combinators) are called fac¬ 
torable forms. Reduction of terms is the smallest contextually closed binary relation -^sf on terms (with 
the reflexive transitive closure denoted by satisfying: 

SMNX -^sfMX{NX) 

FOMN^sfM ifOisSorF 

F {PQ)MN -^sfNPQ if PQ E a factorable form 

Reduction of SF-calculus is confluenl, and fhe K combinator of Combinafory Logic can be repre¬ 
sented in SF-calculus by F F (also, indeed, by F S). Thus, fhere is a frivial encoding of Combinatory 
Logic in SF-calculus which preserves reduction and sfrong normalisation lfT4]| . 

The behaviour of fhe F combinator gives SF-calculus an intensional qualify: one may define higher 
order funcfions in SF-calculus which discriminate befween funcfions whose implementations are dif- 
ferenl even when fhose functions are extensionally equal (i.e. produce identical oufpufs for identical 
inpufs). For example, for any normal form X, fhe ferm /x = S (F F)X implemenfs fhe identify funcfion 
(i.e. IxM — M for all M) and fhus all such terms are extensionally equal. However, an SF-term T can 
be constructed which distinguishes them (by behaving as T Ix —>-|p X). Moreover, one can construct an 
SF-term that can decide the equality of any two arbitrary normal forms. 

The intensional behaviour of SF-calculus is formally characterised by a property called structure 
completeness, which captures the notion that every symbolic computation (i.e. Turing-computable sym¬ 
bolic function) on normal forms is represented by some term. 

Definition 2.2 (Structure Completeness lITSl § 7-8]). Let If be a confluent combinatory calculus whose 
terms include variables, with reduction relation —7>^. Define patterns to be the linear normal forms 
(i.e. containing no more than one occurrence of each variable), and matchable forms to be partially 
applied combinators. 

1. A match {U /P} of a pattern P against a term U may be defined to succeed with a substitution of 
terms for variables, or fail as follows (where Id denotes the identity function and tt) the disjoint 
union of substitutions with match failure as an absorbing element): 

{U/x} = \U/x] {A/A} = Id (if A atomic) 

{UV/PQ} = {U/P}\Li{V/Q} (if UV a compound) 

{f7/P} = fail (otherwise, ifU matchable) 
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2. A case is an equation of the form P = M where P is a pattern and M an arbitrary term, which 
defines a symbolic function ^ on terms by = G (M) if {U/P} succeeds with substitution a, 
and W{U) = U if it fails. 

3. A confluent combinatory calculus is structure complete if for every pattern P and term M, there is 
some term G such that GU — ^{U) for every term U on which ^ is defined; i.e. the symbolic 
function defined by every case is represented by some term. 

Structure completeness subsumes combinatorial completeness since Xx.M is given by the case x = M. 

Theorem 2.3 (HS] Cor. 8.4]). SF-calculus is structure complete. 

The F combinator itself represents a symbolic computation namely that of factorisation: 


,^{A,M,N) =M 
fP{PQ,M,N) =NPQ 


if A is atomic 
if P 2 is compound 


A significant (and arguably remarkable) fact is that cannot be represented in Combinatory Logic (for 
definitions of atomic and compound appropriate thereto). 

Theorem 2.4 ( ifT^ Thm. 3.2]). Factorisation of SK-combinators is a symbolic computation that is not 
representable in Combinatory Logic. 

The equality predicate on normal forms also has no representation in Combinatory Logic. Thus, 
there exist (symbolic) functions, which are clearly ‘computable’ from an empirical point of view, that 
are not (directly) representable in Combinatory Logic (there is also a similar result for A-calculus [4]). 
This result clearly points towards some form of added expressivity possessed by SF-calculus over the 
archetypal computational models. It is to this issue that we will return in Section 

3 Strongly Normalising Solutions of Equational Systems in A-calculus 

We now reiterate the interpretation result of Berarducci and Bbhm |2T|, upon which our technical contri¬ 
bution rests. Essentially, this result says that systems of equations for a particular class of term algebras 
can be given solutions in the A-calculus such that the representation of each atomic term of the algebra 
is strongly normalising thus having a normal form. Moreover, when the set of equations is interpreted as 
a rewrite system the encoding of terms preserves reduction and strong normalisation. 

We assume the usual definitions of the A-calculus without further explanation (readers may refer to 
14] for details), with A denoting the set of lambda terms, —denoting the (multi-step) j8-reduction rela¬ 
tion, and =p denoting j8-equality (i.e. the equivalence relation on lambda terms induced by j8-reduction). 
Furthermore, we also assume the familiar algebraic notion of the set Ter(r) of (L-)terms over the signa¬ 
ture (set of function symbols, each with an associated arity) £. We can then also consider the set A(r) 
of extended lambda terms (i.e. lambda terms which may contain Z-terms); notice that both Ter(r) and A 
are (strict) subsets of A(r). 

Definition 3.1 (Canonical Systems of Equations). Fix a signature L and let S’ be a set of equations 
between terms t G Ter(r). Wc say that S is canonical ifT, can be partitioned into two disjoint subsets Zq 
and'L\ (i.e. T, = 'Lo[J'L\) such that: each equation in S isoftheformf{c{xi,...,x,„),yi,---,yn)=twith 
c G Zo and f (zLi and where the variables x\,...,Xm,y\,---,yn all distinct and form a superset of the 
variables in the term t; and for each distinct pair {c,f) G Zq x Zj there is at most one equation in S of 
this form. Wb say that S is complete if for each distinct pair {c,f) G Zq x Zj there is exactly one such 
equation in S. 
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Canonical systems of equations, then, partition the signature into a set Zo of (algebraic datatype) 
constructors, and Zi of programs defined by pattern matching over the constructors on the first argument. 
Notice that any incomplete canonical system of equations can trivially be made complete by adding 
equations for the missing cases which simply project one of the function’s argument^ 

As an example of a canonical system of equations, we may observe that the usual recursive definition 
of addition over the datatype of (Peano) natural numbers is such a system: 

add (zero,v) = v add (succ (x),y) = succ (add (v,y)) 

We have a signature containing one function symbol add, one nullary constructor zero, and one unary 
constructor succ; moreover in this simple case, the equation system is already complete. In fact, every 
partial recursive function (on natural numbers) can be defined by a canonical system of equations |l5]|6l. 

Given an equational system S’ over a signature Z, we can also take it to define a term rewriting system 
on Ter(Z) by reading each equation as a rewrite rule, i.e. fi{cj{xi,x,n),y \,■ • ■ ,yn) —>■ t. We will write 
for the (one-step) reduction relation of the rewrite system defined by S in this way (i.e. the smallest 
binary relation on terms satisfying the rewrite rules and closed under substitution and contexts), and — 
for its reflexive, transitive closure (i.e. multi-step reduction). Ultimately, the aim is to interpret equational 
systems (and their associated rewrite systems) within A-calculus. 

Definition 3.2 (Interpretations), A representation of the signature Z is a function ^ : Z —)• A from the 
function symbols ofL to (closed) lambda terms, and induces a map : A(Z) —> A in the obvious way, 
namely by x'^ = x, [Xx.M)'^ = Xx.M'^, (MN)^ = , and for f f {t[,... fn)'^ = (/>(/) 

We say that a representation (j) satisfies (or solves) S if for each equation t\ = t 2 (and corresponding 
rewrite rule ti —> t 2 ) in S we have =j} t 2 ^ (and correspondingly also ti^ t 2 ^). When a represen¬ 
tation (j) satisfies S, we say that (j) is an interpretation (or a solution) of S within X-calculus. 

The following construction gives a special kind of representation for canonical systems of equations. 

Definition 3.3 (Canonical Representations). Let S be a canonical system of equations that partitions 
the signature Z into constructors Zq = {ci,... ,Cr} and programs Zi = {f\,... ,ff\. Without loss of 
generality we may assume that S' is complete, and so for each 1 <i <k and \ < j <r let bqp denote 
the term t such that fi{cj{x\,Xm),y \, ■ • ■ An) =tGS. 

We will make use of the following notational abbreviations: 

- Let {ti,...,tn) denote the Church n-tuple, i.e. Xx.xt\ ...tn. 

- Let (where 1 < k <n) be the n-ary k‘^ projection function, i.e. Xxi ... 

- For k>i> 1, letti,...,tk,...,ti-i denote the cyclic permutation oft\,t 2 ,...,tk beginning with tp 
(in an abuse of notation we may also take ti,... ,tk,... fi-i = t\,t 2 ,... fk when / = 1). 

We now define two disjoint representations d- and for constructors and programs respectively. 

(Representation of Constructors) For each \ <i <r, we define the representation of the constructor 
Ci as follows (where n is the arity ofci): 

■d{Ci) = Xxi . ..Xnf.fWiXi ...Xnf 

(Representation of Programs) We choose k distinct fresh variables v\,...,Vk not occurring in S and 
fix a ‘pre-representation’, \j/> ofLi defined by Yifi) = (U) • ■ • Wt ,..., v,_i) /or each 1 < i < k. 

^Alternatively, one might want to introduce a new nullary constructor (denoting an ‘error’ value) and add equations for the 
missing cases that simply return this value. 
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Using this representation, and the representation of constructors defined above, we then define 
kxr lambda terms t(^jp (I <i <k, I < j <r), using the equations in S as follows: 

t^ij) = Xxi.. .XmVi.. .Vk.. .Vi-iyi.. .ynfbqj)''')^ 

where fi{cj{xi,Xm),yi ,... ,yn) = bqj'^ G S’ is the equation defining the behaviour of f when 
given a datum constructed using cj as its first argument. Wfe now define k terms, each one a 
Church r-tuple collating the bodies of all the cases for one of the programs in Zi, as follows: 

U = •,!(;/)) 

(where 1 <i <k). Each program is then represented by a Church k-tuple containing the collated 
representations of each program defintion, beginning with its own. That is, is defined by: 

Cif) = {tu... dk, ■ ■ ■ fi-i) fl <i<k) 

The representation 0 = U ^ w called a canonical representation ofL with respect to S. 

To gain some insight into the construction defined above, one can observe that it is related to an en¬ 
coding of data attributed to ScotH (and thus commonly referred to in the literature as the Scott encoding), 
which has subsequently been developed by others (e.g. ll^lZTlfT^ISn i. In the more familiar ‘standard’ 
encoding of functions, a fixed-point combinator is used to solve any recursion in the definition. This has 
the effect of making recursion explicit, and thus the representations of recursive functions have infinite 
expansions consisting of a ‘list’ of distinct instances of the function body, one for each recursive call 
that may be made. Applying the function to a datum then corresponds to a fold of the datum over this 
list, which discards the remaining infinity of recursive calls once the base case is reached. Therefore, as 
described by Bdhm et al. @ §3], in this scheme functions are ‘diverging objects which, when applied to 
data, may “incidentally” converge’. In encodings of the Scott variety, the recursive nature of functions is 
kept implicit and, while still triggered by application to a datum, only reproduced ‘on demand’. Hence 
we obtain finite objects which now ‘may “incidentally” diverge ’ wh en applied to dat£0. 

To explicate the particular encoding specified by Definition ls.sl. we point out that the representation 
of a constructor is a (lambda) function that takes in the appropriate number of arguments (the sub-data 
of the datum that is subsequently constructed) and then waits to be given a function, which will be the 
program to be executed. Now, looking at how the constructor representation uses this function argument, 
we see that programs should expect to be given a projection function, followed by a number of sub¬ 
data, and then they are also passed a copy of themselves. It is this final element which is the key to 
Scott-type encodings, and allows recursion to be kept implicit. Looking now at the representation of 
programs we see that they are Church k-tuples containing an element for each program defined by S 
(each of which is a Church r-tuple, where each element is a representation of one of the cases of that 
program’s definition). Thus the representation of each program contains the definition of every program 
defined by (f; in particular it will contain the definition of each program which it may itself invoke. To 
illustrate in more detail how the encoding works, we can consider the general reduction sequence of a 
term representing the application of some program prog, to some arguments, the first of which is a datum 
constructed as cy {di,...,dm)' 

(pi'og,-(cy(di,...,r/M))argi ... arg„,)'^ = (t,•,... (cy (r/i,... ,d„,))'^ argf ... arg^, 


^The citation can be found in Curry, Hindley and Seldin (Hip. 504]. 

‘^This reversed form of the slogan is also due to Bohm et al., and illustrates the dual nature of the Scott and Church encodings. 
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{Xx.xti 

• • • ) (Xf.fU^jdi... dmf) argf ... arg^ 

(1) 



... argi 

(2) 

^p 

tilVjdi. 

..dmtiti+\...ti-iarg\ ... 


(3) 

= 

{ti.U--- 

? U,r) ^jd\ . . . dm U .. .ti 

:_i argf . . . arg^, 

(4) 

= 

{Xx.xti^ 

1 • • • ^i,r) ^jd{ . . . dm . 

..f,_iargf ... arg^ 

(5) 

^p 


.Crdi. ..d,ntiti+i .. .h_i argf ... arg^ 

(6) 

^p 

ti j di.. 

■d,ntiti+i...ti_iarg\ ...argi 

(V) 

= 

{Xxi... 

XmVi...Vk...Vi-iyi...yn. 

ib{ij)'*'y)di. ..dmtiti+i. ..ti^i argf . 

• • argt 

~^p 

{bqp [di/x\,...,dm/x,n,argjyi, 


(8) 


When the program is applied to a datum (Eq. ([l])), its representation arranges to apply the datum first to 
the representations of each program beginning with its own, and then to the remainder of the arguments 
(Eq. (l2b)- Then, the particular structure of the datum will reduce the expression to pick out the appropriate 
case of the program definition to be executed, and apply it to the sub-data and the representations of 
each program, having duplicated the program being executed (Eqs. (Q) to 13)). This then reduces to the 
representation of the appropriate substitution instance of the function body (Eq. ([3)). 

The result of Berarducci and Bdhm says that a canonical representation gives an interpretation that 
also preserves strong normalisation. 

Theorem 3.4 (Interpretation Theorem ID Thm 3.4]). Let E be a signature and 8 a canonical set of 
equations for Z; then any canonical representation (j) for Z with respect to S' is an interpretation of S 
within X-calculus. In addition (•)'^ preserves strong normalisation of closed terms. 


4 Encoding the Factorisation Calculus 


In this section, we present our novel technical contribution: an encoding of SE-calculus in A-calculus. 
We believe that this is the first such encoding presented in the literature. Our encoding is a faithful 
simulation; it preserves both the reduction behaviour of terms (thus also j8-equality) and their termination 
behaviour (i.e. strong normalisation). In this section we shall make use of standard notation and results 
for term rewriting systems, details of which may be found in If24]l . 

The key step to the encoding is to define a rew rife sysfem behaviourally equivalenf fo SE-calculus fhaf 


is also canonical, in fhe sense of Definition l3.ll . If is fhen simply a mailer of applying Ihe conslrucfion 
of Berarducci and Bohm lo oblain fhe encoding. Thus if is our franslalion of SE-calculus info fhis 
infermediafe rewrife sysfem fhaf is fhe primary novelly of our confribufion. 

We are aiming lo derive a sel of rewrife rules fhaf is canonical and so we musf Iranslale fhe schematic 
definilion of Jay and Given-Wilson, as presenfed i n Se ction [3, info one consisling of algebraic rewrife 
rules. There are Iwo salienf fealures of Definilion l3.ll fhaf we musf lake info accounf: fhaf fhe rewrife 
rules musf make a dislinclion befween programs and constructors', and fhaf fhe lefl-hand side of each 
rewrite rule musf conlain exaclly one program symbol and one conslrucfor. To oblain rewrite rules of 
fhe required form, we recasl SE-calculus as a curryfied, applicative lerm rewriting sysfem. Thai is, we 
firs! infroduce an explicif program symbol app fo denote application and use fhe symbols S and F solely 
as constructors. Secondly we slralify fhe combinalors C G {S,F} info sels {Co,Ci,C 2 } of conslruclors, 
each of which represenl successive partial applications of Iheir underlying combinalor C. Allhough fhis 
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‘currying’ process is well-known from the world of functional programming, the reader may refer to 
121 for a formal definition of this process in the context of general term rewriting. 

We may take the rewrite rules for the S combinator directly from the standard curryfied applicative 
formulation of Combinatory Logic (see e.g. l3|): 

app (So, x) ^ Si (x) app (Si (x), y) ^ S 2 (x, y) app (S 2 (x, y),z)^ app (app {x, z), app (y, z)) 

The rules for producing the partial applications of the F combinator are similarly straightforward: 

app(Fo,x) ^ fi{x) app(Fi(x),y) ^ F2(x,y) 

The rewrite rule for the full application of the F combinator is more tricky because we must find a way of 
implementing its two possible reductions. As in the original formulation of SF-calculus, since the choice 
of which reduction to make is determined by the structure of the first argument we should like to be able 
to use the pattern-matching capabilities inherent in the term rewriting discipline, e.g. by giving rewrite 
rules such as: 


app(F2(So,y),z) ^y 


app (F 2 (Fi (x), y), z) ^ app (app (z, Fq), x) 


However these rules are not canonical since they contain two occurrences of a constructor: they are 
pattern-matching ‘too deeply’. We can circumvent this by introducing an auxiliary program symbol 
f-reduce and then having the rewrite rule for the F 2 case of app delegate to this new program: 

app(F2 (x, y), z) ^f-reduce(x, y, z) 

Since f-reduce is an independent program symbol, and only needs to pattern match on its first argument 
to determine which result to compute, we may give canonical rewrite rules for it, such as the following: 

f-reduce (So, y,z) -^y f-reduce (Fi (x), y, z) ^ app (app (z, Fo),x) 


We now have all the components to be able to present a canonical rewrite system that faithfully 
implements SF-calculus. 

Definition 4.1 (Currified Applicative SF-Calculus). Let Lsp = Lq U Zi be the signature comprising the 
set Zo = {So,Si,S 2 , Fq, Fi, F 2 } of constructors and the set Zi = {app,f-reduce} of programs. Curryfied 
Applicative SF-calculus is the term rewriting system SF^ defined by the rewrite rules given in Figure\^ 
over the signature Lsf- We denote its one-step and many-step reduction relations by and — 


Notice that the rewrite rules of SF@ are canonical, in the sense of Definition l3.ll. and that they are also 


complete. We also remark that SF@ is an orthogonal term rewriting system 112^ Def. 2.1.1]. 

It is interesting to observe that our implementation of SF-calculus as a canonical applicative term 
rewriting system has involved an application of the Visitor design patterrH |[T3l . When it comes to 
reducing a complete application of the F combinator, the computation must proceed based on the par¬ 
ticular identity of some object (i.e. the first argument), but without having any knowledge of that iden¬ 
tity. The solution is to apply the visitor pattern, which involves invoking a new ‘visit’ operation (that 
we call f-reduce) on the object, which in response executes the appropriate behaviour based on its self- 
knowledge of its own identity. We do not think it is entirely coincidental that the visitor pattern has arisen 
in our work: its connection with structural matching has already been noted |[2^ . and investigating this 
connection further is an avenue for future research. 

There is a straightforward translation from SF-calculus to SF@. 


^In fact, the encoding that we are presenting in this paper arose as a direct result of considering how the Factorisation 
Calculus could be implemented in (Featherweight) Java. 
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app(So,^) ^ Si (x) app(Fo,x)Fi (x) 

app(Si(x), 3 ;) S 2 (x, 3 ;) app(Fi(x), 3 ;) ^ F 2 (x, 3 ;) 

app ($2 (x, 3 ^), z) ^ app (app (x, z), app (y, z)) app (F 2 (x, 3 ^), z) ^ f-reduce (x, 3 ;, z) 

f-reduce(So,3', -z) ^^3^ f-reduce (Si (x), 3^, z) ^ app (app (z, So),x) 

f-reduce (Fo,3',z) ^> 3 ^ f-reduce (Fi (x), 3^, z) ^ app (app (z, Fo),x) 

f-reduce (S2 (p,q),y, z) ^ app (app (z, app (So,;?)), ^) 
f-reduce (F2 (p,q),y, z) ^ app (app (z, app (Fq, ;?)), ^) 

Figure 1: A Complete Set of Canonieal Rewrite Rules for Currified Applieative SF-ealeulus 

Definition 4.2 (Translation of SF-ealeulus to SF@), The translation [[•]](»/rom SF-terms to Sf^@-terms 
is defined by [[S]]@ = So, [[F]]@ = Fo, and [[MA]]@ = app([[M]]@, [[A^]]@). 

We now show that SF@ faithfully implements SF-ealeulus. 

Lemma 4.3 ([[-I]® Preserves Reduetion). LetM and N be SF-terms; ifM — then [[M]]@ —)■* ^ [[A^]]@- 

Proof. It is sufficient to consider the basic reduction rules of SF-calculus. In the interests of clarity, we 
underline the redex that is eontraeted at eaeh step. The ease for S is straightforward: 

[[SMAA]]@ = app (app ( app (So, [[M]]@) , [[A^]]@), [[X]]@) ^3^1 app ( app (Si ([[M]]@), [[A^]]@) , [[X]]@) 
-^sfI apP(S2([M@, [[^]]@)^ M@) ^spf aPP(aPP([M@, app([[A^]]@, M@)) 

= lMXiNX)]]@ 

The ease for F with S the first argument (i.e. atomie) is as follows (the other atomie ease is symmetrie): 

[[FSMA]]@ = app(app( app(Fo, Sp) , [[M]]@), [[A]]@) app( app(Fi (Sp), [[M]]@) , [[A]]@) 

^sf| app(F2(So, [[M]]@), IN]]@) f-reduce(So, [[A]]@) ^3^1 [[M]]@ 

When the first argument to F is a faetorable form, we must further eonsider its strueture. The ease for 
when the first argument is SA (for some term X) is as follows: 

[[F(SA)MA]]@ = app(app(app(Fo, app(So, [[A]]@) ), [[M]]@), [[A]]@) 

^sf| app(app( app(Fo, Si([[A]]@)) , [[M]]@), [[A]]@) 

^sf| app( app(Fi(Si([[A]](a>)), [[M]](a>) , [[A]]@) ^3^1 app (F 2 (Si ([[A]]@), [[M]]@), [[A]]@) 

-^sFg f-reduce (Si ([[A]](a), [[A]]@) ^sfI app(app([[A]]@, So), [[A]]@) = [[ASA]]@ 

Again, the ease for when the first argument is FA (for some term A) is symmetrie and ean be obtained 
from the above sequenee by replaeing eaeh oeeurrenee of Sp by Fp and eaeh oeeurrenee of Si by Fi. 
The ease for when the first argument is SAT (for some terms A and T) is as follows: 


[[F(SAT)MA]]@ = app(app(app(Fo,app(app(So, [[A]]@), [[T]]@)), [[M]]@), [[A]]@) 
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^sf| app(app(app(Fo, app(Si ([[X]]@), [[Y]]@) ), [[M]]@), [[A^]]@) 

^sf| app(app( app(Fo, S 2 ([[X]]@, [[F]]@)) , [[M]]@), [[A^]]@) 

^sf| app( app(Fi(S 2 ([[X]]@, [[y]]@)), [[M]]@) , [[A^]]@) 

->sf| app(F2 (S2([[X]]@, [[y]]@), M(a), [[A^]]@) 

^sf| f-reduce(S 2 ([[X]]@, [[y]]@), [[M]](a, [[A^]]@) 

^sf| app(app(P]]@, app(So, [[^]]@)), [[F]]@) = [[A^(SX)y]]@ 

Once more, the case for when the first argument is FXy (for some terms X and Y) is symmetric and can 
be obtained from the above sequence by replacing each occurrence of So by Fq, each occurrence of Si 
by Fi, and each occurrence of S 2 by F 2 . □ 

To show that [[•]] @ preserves strong normalisation, we will rely on the notion of a perpetual reduction 
sequence. We recall the relevant definitions of perpetual reductions and their properties ||2T1. A term 
t is called an 00 -term (also denoted °°{t)) if it has an infinite reduction sequence. A reduction step 
t ^ s is called perpetual if oo(f) implies °o{s), that is it preserves divergence, and a reduction sequence 
is perpetual if every step t,- is perpetual. Clearly, a perpetual reduction sequence 

t t' also preserves divergence. A redex u is called perpetual if its contraction in every context yields a 
perpetual reduction step. Let n —> f be a substitution instance of a rewrite rule r (so n is a redex and t its 
r-contraction), then call the subterms of u that are those substituted for the variables in r the arguments 
of u. Such an argument is said to be erased if it corresponds to a variable that does not occur in the right- 
hand side of r. It is the case that for orthogonal rewrite systems every redex whose erased arguments are 
strongly normalising and closed (i.e. containing no variables) is perpetual ll^ Cor. 5.1]. 

We first prove a couple of auxiliary lemmas. 

Lemma 4.4. Let N be an SF-normal form, then [[A^]]@ is strongly normalising. 

Proof. We characterise the normal forms as terms taking one of the following forms: S, F, SA, FA, 
SAF or FAF, in which each subterm is also a normal form. We then proceed by induction on the size of 
terms. The base cases, i.e. when N is either S of F are trivial since then [[A]] @ is itself a normal form. For 
the inductive cases, notice that the terms Si ([[A]]@), Fi ([[A]]@), S 2 ([[A]]@, [[F]]@) and F 2 ([[A]]@, [[F]]@) 
are strongly normalising since they are head normal and by induction [[A]]@ and [[F]]@ are strongly 
normalising as A and F are by definition normal forms (smaller than N). It is then straightforward to 
show in each case that the (unique) reduction from [[A]] @ to its corresponding head normal form given 
above is perpetual since it does not erase any arguments, and SF@ is an orthogonal rewrite system. The 
result then follows. □ 


Lemma 4.5. Let X, M and A be SF-normal forms, and O an operator (i.e. either S or F) such that 
OXMN -^sfF: then ^[[[C?AMA]](a] —)■* ^ ^[[[^]]@] i^ ^ perpetual reduction sequence for any S/-@- 
term context 


Proof. We show that there is a reduction sequence \OXMN\@ [[A]]@ which contracts a perpetual 
redex at each step, from which the result immediately follows. In fact, the reduction sequences that 
witness this are exactly those that are used to show preservatio n of reduction. In each case notice that, 
in the reduction sequence demonstrated in the proof of Lemma 14.31 the only erased argument (when it 
exists) i s in the final reduction step and in each case this argument is either [[M]]@ or [[A]]@ which by 
Lemma l4.4l is strongly normalising since M and A are normal forms (and also closed since we do not 
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consider SF-terms with variables). Thus, since SF@ is an orthogonal rewrite system, it follows that the 
redex contracted at each step is perpetual. □ 

We can now prove the following result. 

Lemma 4.6 ([[-I]® Preserves Strong Normalisation). Let M be a strongly normalising SF-term; then 
[[M]] @ is strongly normalising. 

Proof. We use the same technique as used in the proof of lH Thm. 3.4(2)], and proceed by (strong) 
induction on the length n of the longest reduction sequence from M to its normal form. When n = 0, then 
we have that M is a (SF-)normal form and thus [[M]]@ is strongly normalising w.r.t ^ by Lemma Id^ . 
When n> 0 then M must contain at least one redex OXYZ. Consider an innermost redex, whose 
contractum is the term R. Thus M = '^[OXY Z] = N (for some (SF-)term context ^) and X, 

Y and Z are normal forms (since the redex is innermost). Now, since M is strongly normalising so too is 
N, and the length of its longest reduction sequence must be strictly less than n (otherwise n would not be 
maximum). Thus by the inductive hypothesis [[N]]@ is strongly normalising. Consider now the structure 
of [[M]]@: we have [[M]]@ =^'^'[[[OXFZ]]@] for some (SF@-)term context Since X, Y and Z are 
normal forms, by Lemma Id^ there is a perpetual reduction sequence from [[M]](a = ^'[[[OZFZ]]@] to 
[[N]]@ = ^^[[[f?]]@], i.e. one which preserves divergence. Therefore, since [[N]]@ is strongly normalising 
so too is [[M]](a (if it were not, neither would [[N]]@ be). □ 

Using Berarducci and Bdhm’s construction, outlined in Section we obtain an encoding of SF- 
calculus in the A-calculus. 

Definition 4.7 (Translation of SF-calculus to A-calculus). Fix a canonical representation ([isf o/Lsf 
w.r.t. the rewrite rules of Sr@. The mapping [[•]];(^ = (• translates SF-calculus to X-calculus. 

We leave it as an exercise to the reader to compute such a canonical representation 0 sf- 

We now present our main result: that is a faithful encoding of SF-calculus in A-calculus. 

Theorem 4.8 (Faithful Encoding of SF-calculus in A-calculus). The translation of SF-calculus into 
X-calculus preserves reduction and strong normalisation. 


Proof. The result follows directly from the fact that the two translations that are composed to obtain 
[[•]];^, na mely an d each satisfy both these properties. In the case of the fo rmer , we refer to 

Lemmas 43 and 43; for the latter to the results of Berarducci and Bdhm, cf. Theorem Is.dl (note that all 
terms are closed, since we do not consider SF-calculus with variables). □ 


5 Expressiveness of Factorisation: Discussion & Related Work 

We now turn out attention to the question of the expressiv enes s of SF-calculus relative to A-calculus and 
Combinatory Logic. On one hand our results, i.e. Theorem id.sl. along with those of Jay and Given-Wilson 
ca, show that SF-calculus and A-calculus simulate the same executions. On the other, SF-calculus is 
structure complete whereas A-calculus is not. Is this a contradiction and, if so, how may it be resolved? 
Notwithstanding the long tradition of using simulations to characterise computational equivalence, it 
has since been realised that a refinement of this notion is necessary to draw richer, more meaningful 
comparisons. While a complete and in-depth analysis is not within the scope of the current paper, by 
discussing our results with reference to some of this work we aim to draw some concrete conclusions 
about the how the expressiveness of structure completeness and SF-calculus may be characterised. 
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The ‘Standard’ Notion of Equivalence. From the earliest research into computability, two aspects 
of abstract notions of computation were identified as relevant to the idea of expressiveness. Firstly, one 
wants to compare the respective set of functions that each model computes. For example, in recursion 
theory it was shown that the set of primitive recursive functions (on natural numbers) is a strict subset 
of the recursive functions (see e.g. |{32]|). At the same time, there is a requirement to compare models 
that operate, at a fundamental level, in diverse domains. Turing Machines, A-calculus and Combinatory 
Logic are, operationally, quite different ways to compute. It was shown however that via particular 
(and now canonical) representations of numbers, each model can ‘compute’ the same set of functions 
on natural numbers, namely the partial recursive functions. Conversely, Gddelization allows each of the 
computations in these models to be simulated by a partial recursive function Ii22l . 

This idea of simulation extends to the operation of the models themselves: each model may simulate 
the operational behaviour of the others. Such simulations also appear to abstract away the problem of 
representation: often we do want to compute functions of natural numbers, however more often we 
desire to compute functions over different domains; it is incumbent upon us to represent elements of the 
desired domain of discourse as terms of the computational model. The initial characterisation of the set 
of ‘computable’ functions was over the domain of natural number^ but what is the set of ‘computable’ 
functions over some other given domain? With simulations between models it seems that one may at 
least lay this question aside by observing that whatever can be represented in one model may then also 
be represented in the other, and therefore whatever functions they do compute it is the same in both 
cases. A stronger conclusion would be that the set of computable functions over arbitrary domains is 
isomorphic to the computable functions on natural numbers. 

This simulation method has long since become the standard: to show two models are of equivalent 
computational power, demonstrate simulations of each in the other. One model of computation is only 
more powerful than another, then, if it is not possible to simulate the former in the latter. The approach 
has been cemented over the years, notably in Landin’s now seminal work ll25l . As the search space of 
formal computation has been explored the notion of simulation has been adapted accordingly, and there 
are now a number of sophisticated simulatio ns be tween all sorts of models, both sequential and concur¬ 
rent (e.g. milSl). In this tradition. Theorem k.sl is a result showing that A-calculus is computationally 
as powerful as SF-calculus. 


Refining the Notion of Expressiveness. It was already noted over two decades ago that despite the 
broad applicability and application of the simulation method, it is not actually a fine-grained enough 
notion to provide a complete and universal characterisation of expressiveness. Felleisen observed that 
since the languages we wish to compare are (usually) Turing-complete, other methods (than simulation) 
must be found in order to verify claims of relative (in-)expressiveness ifT^ . He proposed a framework 
based on the concept, from logic, of eliminability of symbols from conservative extensions |j23l. One 
logical system .if is a conservative extension of another system Sf" if the expressions (formulae) and 
theorems of the latter are subsets of those of the former. A symbol of .if (which is not in Sf") is eliminable 
if there is a homomorphism (i.e. a map preserving the syntactic structure) tp : Exp(.if) —)• Exp(.if') from 
the expressions of .if to the expressions of .if', which acts as identity for expressions of .if', such that 
an expression f is a theorem of .if if and only if (p{t) is a theorem of .if'. Felleisen extends this to 
programming languages by analogy - formulae are (syntactically valid) programs and the theorems are 
the terminating programs. Then we may say that language .if is more expressive than language .if' when 

^Turing’s work is different in this respect, since he deliberately embarked on a characterisation of computable functions 
over a different domain, namely that of strings of arbitrary symbols. 
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the former adds some non-eliminable syntactic construct, i.e. one which cannot be ‘translated away’. 
Thus the standard simulation approach is refined by imposing an extra criterion when one language is a 
superset of another: can the larger one be built from the smaller one using macros? 

To place SF-calculus in this framework we can consider SKF-calculus, i.e. the extension of SF- 
calculus by including an additional atomic term: the familiar K combinator. Since SKF-calculus is a 
proper extension of Combinatory Logic, obtained by adding the F combinator, we can apply the expres¬ 
siveness test of Felleisen. Th at is , we ask is F eliminable? The answer to this question is no; indeed 
this is guaranteed by Theorem 12.4 In this sense. Jay and Given-Wilson’s results do justify the claim that 
SF-calculus is more expressive than Combinatory Logic and A-calculus. 


More Abstract Notions of Computational Equivalence. The simulation method, and its refinement 
described above, are still firmly grounded in an operational view of compulation, but recent work has 
sought to anchor formal comparisons of expressiveness in a more general, abstract basis. A notable 
contribution to this effort is the work of Boker and Dershowitz f7j. They abstract the notion of compu¬ 
tational model as simply its extension, i.e. the set of functions over its inherent domain that it computes, 
and consider simulations (encodings) between them. They derive the remarkable result that combining 
the standard simulation approach with the natural containment of one extensionality within another leads 
to a paradox: some computational models can simulate models which are strictly more powerful in the 
sense that they have larger extensionalities (i.e. compute more functions). Thus, some representations 
add more computational power. 


This result begs the question: do we consider simulations via such ‘active’ mappings to constitute an 
equivalence? One may suspect that the problem lies in allowing injective mappings between domains, 
and that imposing stricter conditions (e.g. bijections) would ensure ‘passiveness’. This is indeed the 
case, but adopting such restrictions is useless for most comparisons since the passive encodings are 
ones which are “almost identity”. We have no choice but to allow such encodings, although we do 
have the option of considering a hierarchy of equivalences based on the properties of the encodings used. 
Boker and Dershowitz define four increasingly sfricfer notions of (in-)equivalence based on, respecfively: 
injecfive encodings (power equivalence), corresponding fo fhe sfandard approach; injective encodings 
for which fhe images are compufable in fhe simulating models (decenf power equivalence); bijecfive 
encodings (bijecfive power equivalence); and bijections fhaf are inverses of each ofher (isomorphism). 
Boker and Dershowifz also show thaf fhere are models (including Turing Machines and fhe recursive 
numeric funclions) which cannof simulafe any sfronger models; fhey are interpretation complete. 

We may also gain insighf info fhe relative expressiveness of SF-calc ulus and A-calculus using fhis 
framework. Our resulf shows fhaf fhey are power equivalent. Theorem 12.4 shows fhaf fhey cannof be 
bijecfively power equivalenf (nor, fherefore, isomorphic) since fhaf would imply fhaf A-calculus could 
disfinguish arbifrary normal forms. Thus, in fhis sense fhey are nol equivalenf. We do not know if SF- 
calculus is bijectively stronger than A-calculus; this would require demonstrating a bijective encoding of 
the latter in the former. Also, we do not know if A-calculus is decently power equivalent to SF-calculus; 
we consider it at least a possibility. We would also expect that, due to its intensional capabilities, SF- 
calculus is interpretation complete. 


Related to the work of Boker and Dershowitz, is that of Cockett and Hofstra |9i|, and Longley ll2^ 
which are both concerned with category-theoretic descriptions of abstract computational models. In these 
frameworks model equivalence is interpreted by categorical isomorphism, and so akin to the strongest 
notion of equivalence considered by Boker and Dershowitz. 
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6 Conclusions & Future Work 

In this paper, we have considered the relationship of the recently introduced SF-calculus to the ‘canoni¬ 
cal’ computational model of A-calculus and, so by extension. Combinatory Logic. We have demonstrated 
that SF-calculus can be faithfully encoded (i.e. simulated) in the A-calculus by defining a behaviourally 
equivalent applicative term rewriting system and then interpreting this system in A-calculus using a con¬ 
struction of Berarducci and Bdhm. This result shows that SF-calculus and A-calculus are of equivalent 
computational power, according to the classical interpretation of computational equivalence. We have 
also considered the relationship of SF-calculus to the A-calculus using a more nuanced interpretation of 
equivalence, informed by research in the literature. Moreover, we hope to have exposed both SF-calculus 
and Berarducci and Bdhm’s encoding to greater prominence. We feel that they are both subjects of great 
interest which deserve to be better known. 

With respect to future work, there is still great scope for investigating the expressiveness of SF- 
calculus. There are the open questions we have highlighted regarding SF-calculus as it relates to the 
framework of Boker and Dershowitz. The categorical and denotational natures of SF-calculus also de¬ 
serve exploration. Beyond this, the wider question of how best to qualify computational expressiveness 
still remains; we believe the study of SF-calculus can provide further insights on this. For example, 
the structural completeness property is already a new metric; Jay and Vergara have also considered a 
strengthening of the notion of decent power equivalence in which the simulation of each model in it¬ 
self via the composition of the two encodings is computable in each model ifT^ . Quantitative questions 
regarding expressiveness also present themselves: e.g. our encoding of SF-calculus in A-calculus leads 
to a large increase in the size of terms; are there lower bounds on the size of such increases which 
meaningfully quantify expressive power? 
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